On Some Mathematics for Visualizing High Dimensional Data
نویسندگان
چکیده
The analysis of high-dimensional data offers a great challenge to the analyst because the human intuition about geometry of high dimensions fails. We have found that a combination of three basic techniques proves to be extraordinarily effective for visualizing large, high-dimensional data sets. Two important methods for visualizing high-dimensional data involve the parallel coordinate system and the grand tour. Another technique which we have dubbed saturation brushing is the third method. The parallel coordinate system involves methods in high-dimensional Euclidean geometry, projective geometry, and graph theory while the the grand tour involves high-dimensional space filling curves, differential geometry, and fractal geometry. This paper describes a synthesis of these techniques into an approach that helps build the intuition of the analyst. The emphasis in this paper is on the underlying mathematics.
منابع مشابه
Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data
Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...
متن کاملVisualizing Fluid Flow Data : From the Canvas
of “Visualizing Fluid Flow Data: From the Canvas to the CAVE” by Robert M. Kirby II, Sc.M., Brown University, May 2001. Scientific visualization not only expands our understanding of fluid flow phenomena by allowing us to examine the evolution of different flow quantities, but also it can be used as a catalyst for the development of mathematical models which describe the time evolution of compl...
متن کاملHigh-Dimensional Unsupervised Active Learning Method
In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...
متن کاملVisualizing Company Fitness in High-Dimensional Parameter Space
This paper explores the use of Data Mining algorithms to reduce and project internal company parameter data onto a two dimensional landscape which can show the course and trajectory of the company. We show how the resulting landscape can be used for analysis, planning, prediction, and ultimately in some ways, navigation. 1.0 Introduction We will define a company's " state " as aspects of its in...
متن کاملSome properties of band matrix and its application to the numerical solution one-dimensional Bratu's problem
A Class of new methods based on a septic non-polynomial spline function for the numerical solution one-dimensional Bratu's problem are presented. The local truncation errors and the methods of order 2th, 4th, 6th, 8th, 10th, and 12th, are obtained. The inverse of some band matrixes are obtained which are required in proving the convergence analysis of the presented method. Associated boundary f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001